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1 Introduction 

A graph with vertex set V and edge set E is called a (d, c)- expander if the 
maximum degree of a vertex is d and, for every set W C V of cardinality at 
most \V\/2, the inequality \{w G V \ W : {v,w} G E}\ > c\V\ holds. This 
note considers a related combinatorial question: 

For which integers d and functions fd does there exist, for every 
large enough v, a bipartite (i-regular multigraph on 2v nodes 
with node sets V and W having the following property: For 
every U C V and every U C W, the cardinality of the set of 
neighbours of U is at least fd(\U\)7 

Graphs with the above property seem to behave well also with respect 
to other, more complicated, expansion-type properties. Indeed, the au- 
thor was motivated to study this question by a paper communicated to 
him in May 2002 (the latest version of the paper is available from URL 
http://www-math.mit.edu/~vempala/papers/tspinapprox.ps). In this pa- 
per, Papadimitriou and Vempala established approximation hardness of TSP 
with triangle inequality using as a tool in their construction the fact that 
for d = 6 and 



2u < u < v/4, 

u + v/i v/i<u<v/2, 
u/2 + v/2 v/2<u<v, 



there exist bipartite multigraphs with the properties described in the above 
question. In this paper, we prove the following theorem: 

Theorem 1. For d G {5,6,7,8} and functions fd as described below, there 
exists, for every large enough v, a bipartite d-regular multigraph on 2v nodes 



1 



with node sets V and W having the property that for every U C V and every 
U QW the cardinality of the set of neighbours of U is at least fd(\U\). 



h{u) 



k{u) 



fr(u) 



fs(n) 



(1) 



(2) 



2u 0<u< 3v/20, 

4u/3 + u/10 3v/20 < u < 3u/10, 
u + v/5 3v/10 < u < v/2, 

3n/4 + 13u/40 v/2 < u < 7v/10, 
u/2 + v/2 7v/W <u <v, 

'hu/2 0<u<v/W, 

5u/3 + v/12 v/W < u < v/4, 

u + v/4 v/4<u<v/2, 

3u/5 + 9^/20 v/2 < u < 3v/4, 

2u/h + 3v/h 3v/4<u<v, 

'3u 0<u<u/10, 
2u + v/10 u/10 < u < 3v/20, 

21«/15 + 19u/100 3w/20 < u < 3v/W, 
u + 31u/100 30u/100 < u < 39t;/100, 

15u/21 + 59t;/140 39^/100 < u < 3u/5, 
u/2 + llt;/20 3w/5 < u < 7v/W, 

u/3 + 2v/3 7v/10<u<v. 

3u < u < v/10, 

2u + v/10 v/W<u<v/5, 

5u/4 + v/A v/5<u<v/3, 

4u/5 + 2v/5 v/3<u<v/2, 

u/2 + llv/20 v/2<u<7v/W. 

u/3 + 2v/3 7v/10 < u < v. 



The proof of the theorem relies partly on claims that have been verified 
by a computer. Hence, a second contribution in this paper is the description 
of a fairly general methodology for devising computer-assisted proofs for a 
wide class of mathematical claims. 



(3) 



(4) 



1.1 The probabilistic method 

The probabilistic method j 1 4 is particularly well-suited for proving that there 
exists, in some large class of objects, an object with certain specified prop- 
erties. Typically, the argument proceeds by first selecting a random object 
from the class and then estimating the probability that the selected object 
does not have the sought properties. If this probability can be shown to 
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be strictly less than one, the probability that the selected object has the 
property is strictly positive and hence there exists an object with the sought 
properties. 

The method is highly non-constructive. For our case, we in fact have no 
idea whatsoever how to actually construct a graph with the sought properties 
in time polynomial in the number of vertices in the graph. 

1.2 Computer-assisted proofs 

As mentioned in the previous section, a critical component in the probabilis- 
tic method is to show that the probability of some event is strictly less than 
one. In our case, the function expressing this probability is fairly compli- 
cated, although continuously differentiable almost everywhere. In principle, 
straightforward but tedious analysis of first and second order derivatives of 
the function could be used to prove that it is strictly less than one. We feel, 
however, that the contribution to the community from such a proof is very 
minor. Instead we resort to a computer-assisted proof and argue that this 
method of proof should be accepted in cases like ours. 

First, what is a computer assisted proof of some statement a f(x) < c for 
all x € [a, 6]"? Simply evaluating the function at some points is clearly not 
enough — the function may assume other, dangerous, values at the points 
where it was not evaluated. Given some bound, proven by a conventional 
mathematical proof, of the form < C for all x € [a, &]", we could 

argue that it is enough to evaluate the function at points that are sufficiently 
close since a Taylor expansion then bounds the value of the function at all 
points. However, floating point computations done by computers are not 
accurate, and the latter argument above fails to take into account possible 
influences of round-off errors. 

A solution to the problem of round-off errors is to use interval arith- 
metic 2 a . The main idea behind computations with interval arithmetic is 
to compute not with single numbers but rather with intervals. When some 
function / is applied to some interval I, the result is an interval that contains 
f(x) for every x G /. Hence, interval arithmetic is particularly well suited for 
verifying claims that are of the form "/(x) < c for x £ /". If the interval / 
is large, then the result of computing /(I) is usually also a large interval; in 
particular, I could also contain c. The solution to this problem is to split / 
into sub-intervals that are sufficiently small and then compute f{Ik) for each 
sub-interval The computer-assisted part of our proofs works precisely in 
this way and was inspired by Uri Zwick's work on optimal approximation 
algorithms for certain constraint satisfaction problems _4. 

We argue that computer-assisted proofs should be accepted in cases like 
ours. To examine and ascertain the correctness of the program presented 
in this paper is the same thing as examining a conventional mathematical 
proof. It is, of course, true that the correctness of the computer-assisted 
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verification of the claims that the program verifies, requires the assumption 
that the program is correctly compiled and that all used library routines 
are correctly written. It is in principle possible to construct a compiler 
that automatically includes logging facilities in the executable program; the 
output from such a system could then in principle be verified step by step 
to ascertain the correctness of the computation. 



2 Proof of Theorem [T] 

We select a ci-regular bipartite multigraph on 2v vertices by selecting one per- 
fect matching in a bipartite graph on dv + dv vertices uniformly at random. 
From this perfect matching, the <i-regular bipartite graph is constructed 
by identifying groups of d vertices in the "big" bipartite graph with single 
vertices in the sought (i-regular bipartite graph. 

The analysis of the construction proceeds by estimating the probability 
that such a randomly chosen graph does not have the desired properties, 
i.e., the probability that there is some U C V or some U such that the 
cardinality of the set of neighbours of U is no more than fd(\U\). 

Fix a set U C V of size u and a set N C W of size n. When u > n, 
the pigeon-hole principle implies that U cannot have neighbours only in N. 
When u < n, the probability that U has neighbours only in N is 

dn\ I (dv 
duj I \du 

since there are in total ways to choose the neighbours of U and (j™) 
of those choices result in neighbours only in N. It turns out that we need 
different methods to bound this probability depending on how close (u, n) 
is to the "extreme points" of a certain region. Therefore, we define 

O = {(u,n) :u<n< f d (u)}, 

ri<5 = {(u, n) G CI : min-ftx, v — n} < 5v}, 

n' = q \ n 6 . 

With this notation, the probability that there is some U Q V or some 
U C W such that the cardinality of the set of neighbours of U is no more 
than fd(\U\) can be written 

(u,n)en U&V N£W 
\U\=u \N\=n 



duj \du 



where the factor 2 above comes from the fact the we consider not only 
neighbour sets of U C V but also neighbour sets of XJ C W. Let 

P(u, n) = Q Q Q J I Q J = Q Q ^n-du)\{dv)\ ' (5) 
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Using P and the O's, the probability that there is some U Q V or some 
U C W such that the cardinality of the set of neighbours of U is no 
more than fd(\U\) can be upper bounded by 2| | max^^g^ P(u, n) + 
2|0 / |max (Uin)en / P(u,n). 

As can be seen from Figure 2\Qs\ < 2Svf(5v) and 2|0'| < v 2 . The 
proof now proceeds by setting 5 = 10~ 5 and then proving that 

26vf(6v) max P(u,n) < 1/2, (6) 

t> 2 max P(u,n) < 1/2. (7) 

(u,n)G!)' 

This is enough to complete the proof, since in that case the probability that 
a randomly selected graph has the desired properties is non-zero and hence 
there exists at least one graph with the desired properties. 



2.1 Analysis close to extreme points 

Since both P and our functions fa for d G {5, 6, 7, 8} are symmetric with 
respect to reflection around the line u + n = v, it is enough to consider pairs 
(u, n) G Clg such that u + n < v. For fixed u < 5v, 

P(u,n + 1) (v — n) -i^j- dn + i 

P(u, n) (n + 1) dn — du + i 

therefore P(u,n) is increasing in n. Hence, it suffices to bound P(u, fa(u)). 
The following lemma establishes a slightly more general result. 

Lemma 1. For every integer u such that 1 < u < I0~ 5 v and every (k,d) G 
{(2, 5), (3, 6), (3, 7), (3, 8)}, 2k5 2 v 2 P(u,ku) < 1/2 where P is defined by 0J. 



Proof. The proof is by induction on u. The base case is clear since 



! fc! ' (v - l) c 



For d > 3 + £; and t> large enough, the latter expression above is strictly less 
than 1/2. For the cases when d = 3 + k, the latter expression is less than 

10- 10 • 3 7 • —^—j < 1/2. 

(v - l) d 1 

For the inductive step, we show that P(u, ku)/P{u + 1, ku + k) > 1. Since 

p{uM) 0( k i)( k d d :)/( dv > 



du) 



P(u + l,ku + k) ( Mf, v )(kdu+kd\/(dv \ 
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Figure 1. The level curve Q(u, n) = 1 and the function fd plotted for d = 8 
with solid lines. The lower picture shows half of the set £1$. 
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we need estimates of the following form: 

v\ I ( v \ u + 1 u + 1 

> 



u J / \u + 1 J v — U V 

k 



_ nti(^ + > {ku + l) k (u + l) k 
kuj I \ku + kj ~ nto( u ~ku-i) > {v- ku) k > v k 



dv \ / (dv\ {dv-du-d) d (v-2u) d 
du + d) I \du) > (du + d) d > {u + l) d ' 
(kdu\ I fkdu + kd\ (du) d ({k - l)duf k ^ d _ (k - l)^- 1 )** 



u kd 



\du J / \ du + d J " (kdu + kd) kd k kd (u + l} kd 

Put together, the above bounds imply that 

P(u,ku) (v-2u) d {k-l)( k -^ d u kd 



P(u + l,ku + k) v k+1 (u + l) d - k - 1 k kd (u + l) kd 

For v > 5 2 and u such that 1 < u < 5v, this is greater than 

(1 - 25) d (k - 5 kd 
(5 + 5 2 ) d - k - 1 W d (5 + 5 2 ) kd 

>S-( d -^\l-(kd + 3d)6) { ±^- d . 

For (k,d) = (2,5) the above ratio is at least 10 10 2~ 10 (1 - 255) > 1. For 
k = 3 the ratio is at least io M ~ 20 2 M 3~ M (l — 6d5), which is strictly greater 
than one for the d considered. ■ 

2.2 The interior region 

To bound v 2 P(u, n) in $7', write u = av and n = j3v and apply Stirling's 
formula 

( V ) =(a~ a (l-a)-^) v poly(v) 
\av J 

to P(u,n) defined in ©: 

V P{aV ' H = U a (l-/3)^(/5-«)^)J P01J(V) - 

Note that the above expression is valid also for u = n, i.e., also for a = /3, if 
we use the convention that 0° = 1. By the symmetry of the function P and 
the results obtained in § 12.11 it is enough to consider pairs (a,/3) in the set 

A = {(a, (3) : /3-a>0Af3v < f d (av) A a > 10" 5 A (3 < 1 - 1(T 5 }. (8) 
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Hence, it is sufficient to prove that there exists a universal constant c < 1, 
strictly bounded away from 1, such that 

(I _ a )(d-l)(l-a) g(d-l)/3 

To this end, we first show that it is enough to consider the boundaries of A, 
i.e., the points where (5v = fd(av), and then analyze the function Q on those 
line segments. 

2.2.1 It is enough to consider the boundary 

To achieve the first goal, we prove that InQ is convex along lines of the 
form (3 — a = y for non-negative y. This amounts to substituting, e.g., 
a = (1 + x — y)/2 and (3 = (1 + x + y)/2 in the expression for Q and then 
consider the resulting expression as a function of x for arbitrary fixed y. 

Lemma 2. For every fixed j/G [0,1-2/4 the function 

q(x; y) : x i-> In Q((l + x - y)/2, (1 + x + y)/2) 

is convex in the interval 
d(l-y)-2 



x < 



d-2 



Proof. Straightforward substitution shows that q(x; y) = — dy(ln2 + lny) — 
(d — 2) In 2 + \g(x\ y) where 

g(x;y) = (d-l)(l + x + y)ln(l + x + y) + (d-l){l-x + y)ln(l-x + y) 
- (1 + x — y) ln(l + x - y) - (1 — x — y) ln(l - x — y). 

Hence the derivative of q with respect to x is 

n s i , ni l + x + y l + x-y 

g {x; y) = [d- 1 In ■ In 

1 — x + y 1 — x — y 

and the second derivative is, consequently, 

(d-l)(2 + 2y) 2-2y 



g"(x;y) 



(l + x + y)(l-x + y) (l-x-y)(l + x-y) 

We now rewrite the second derivative as 

= (d-2)(l-x 2 -^)-^(l+x 2 -y 2 ) 
9 {X,V) (l+x + y)(l-x + y)(l-x-y)(l + x-y) 

and obtain that g"(x;y) is non- negative when 
(d - 2)(1 - x 2 - y 2 ) >dy(l + x 2 -y 2 ), 
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or, equivalently, 



x \d{l+y)-2) < (d(l-y)-2)(l-y 2 ). 

It is now straightforward to see by substitution that the above inequality is 
satisfied as soon as \x\ < (d(l — y) — 2)/(d — 2) and y G [0, 1 — 2/d]. Finally, 
we remark that the same result is valid also for the case when y = 0: q{x; 0) 
is convex for \x\ < 1. ■ 

To summarize, the function q considered in the above lemma is con- 
vex along lines parallel to the x-axis inside a triangle with corners (x, y) G 
{(1, 0), (— 1, 0), (0, 1 — 2/d)}. Translated to (a, /3)-coordinates, the lemma 
therefore implies that it is enough to bound the function Q(a,/3) on the 
boundaries of A as soon as il' is contained inside the triangle with corners 
(a, (5) G {(1, 1), (0,0), (1/ti, 1 — For our case, it can be seen without 

much ado that Vt' is indeed contained in this triangle: the functions fa are 
piecewise linear and the slopes of the line segments are all strictly less than 
the slope of the line from (0,0) to (l/d, 1 — 1/d). 

2.2.2 Bounding the function on the boundary 

Since the function Q(a,/3) is symmetric with respect to reflection around 
the line a + /3 = 1, it is sufficient to prove that Q(a,(3) is strictly less than 
one on the first three "legs" of /s, the first three "legs" of fa, the first four 
"legs" of fi, and the first three "legs" of fs- In principle, this can be done 
by substituting (3 = /d(a) and then analyzing the resulting function using 
calculus. Since this is extraordinarily tedious — both for the author and for 
the reader — we instead choose a computer-assisted method of proof using 
interval arithmetic |2J. Specifically, we present a computer program that 
verifies the following claims: 

Claim 1. For d = 5 and a G [10" 5 ,0.5], Q(a, Ifc(av)) < 0.9999 where Q 
is defined by © and f 5 by (JTJ). 

Claim 2. For d = 6 and a G [10" 5 ,0.5], Q(a, ~/ 6 (cra)) < 0.9999 where Q 
is defined by © and / 6 by ©. 

Claim 3. For d = 7 and a G [10~ 5 ,0.39], Q{a, lh{av)) < 0.9999 where Q 
is defined by © and / 7 by ©. 

Claim 4. For d = 8 and a G [10~ 5 ,0.34], Q(a, Ifs(av)) < 0.9999 where Q 
is defined by © and / 8 by (gj). 

There is built-in support for interval arithmetic in some compilers for 
some programming languages. The author has constructed a C++ program 
that uses the interval arithmetic routines built into Sun One Studio 7 [3]. 
Complete source code of the program is given in the appendix. Most of it 
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is self-explanatory, the exception being, maybe, the recursive function that 
verifies that some function f(x,y) is strictly less than some given upper 
bound b along the line y = y(x): 

bool check(const Af fineFunctionft y, const di& i) { 
if(clt(f (i,y(i)), b)) { 

log « i « ": " « sup(f (i,y(i))) « std::endl; 
return true ; 

} 

else { 

return check(y, interval_hull(di (inf (i) ) , di (mid(i) ) ) ) && 
check(y, interval_hull(di(mid(i)) , di (sup(i) ) ) ) ; 

> 

} 

This function computes an interval I containing {f(x,y(x)) : x € i}. If that 
interval is certainly less than an interval b that contains the desired upper 
bound, i.e., if Vz € I\/w E b[z < w], the claim has been verified on the 
interval i. Otherwise, the verification proceeds recursively: Two intervals 
that together cover i are constructed, and the verification continues on those 
intervals. A transcript of the verified subintervals is written to a log file. 

When compiled with Sun One Studio 7 and executed on a Sun worksta- 
tion, the program verified Claims in a couple of seconds. The number 
of subintervals used were 332, 391, 857, and 261, respectively. The only 
option given to the compiler was "-xia" , which enables support for interval 
arithmetic. 

3 Conclusions 

The methods described in this paper are fairly general and can be applied 
to larger values of d without any complications. For smaller d, the analysis 
of the function P close to the extreme points needs to be adapted. 

Our functions fd are all symmetric along the line u+n = v. This property 
follows from our use of the probabilistic method; the involved probabilities 
have the same symmetry. One possible direction for future work could be 
to improve the behaviour of the functions fd in the region where u is close 
to v. Indeed, for large d, one would expect that the neighbour set of any 
0.99i> vertices is the entire other side of the bipartite graph. 
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A Source code 



1 #include <fstream> 

2 #include <suninterval .h> 

3 typedef SUNW_interval : : interval<double> di; 

4 

5 class Q 

6 { 



7 const di d; 

8 const di one; 

9 public: 

io Q(const int degree) : d(degree) , one("[l]") {} 

n di operator () (const difc alpha, const dift beta) const { 

12 return (pow(one-alpha, (d-one) * (one-alpha) ) * pow(beta, (d-one) *beta) ) 

13 / (pow(alpha, alpha) * pow(one-beta, one-beta) 

14 * pow (beta-alpha, d* (beta-alpha) )) ; 

15 } 

16 }; 

17 

18 class Af f ineFunction 

19 { 

20 const di k; 

21 const di m; 

22 public: 

23 Af f ineFunction(const di& slope, const di& offset) 

24 : k(slope) , m(off set) {} 

25 Af f ineFunction(const difc xO, const di& yO, const di& xl, const di& yl) 

26 : k((yl-y0)/(xl-x0)) , m(y0-k*x0) {} 

27 di operatorO (const difc x) const { return k*x+m; } 

28 }; 

29 

30 class Segment 

31 { 

32 const di i; 

33 const Aff ineFunction f; 

34 public: 

35 Segment (const di& preimage, const difc slope, const difc offset) 

36 : i (preimage), f (slope, offset) -Q 

37 Segment (const difc xO, const di& yO, const di& xl, const dife yl) 

38 : i(interval_hull(xO,xl)) , f(x0, yO, xl, yl) -Q 

39 const dift preimage () const { return i; > 

40 const Af f ineFunctionfc functionO const { return f; } 

41 }; 

42 

43 class Checker 

44 { 

45 const di b; 

46 const Q& f ; 

47 std: : of stream log; 

48 bool check(const Af f ineFunctionft y, const dift i) { 

49 if (clt(f (i,y(i)) , b)) { 

50 log « i « ": " « sup(f (i,y(i))) « std::endl; 

51 return true; 

52 } 
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53 else { 

54 return check(y, interval_hull (di (inf (i) ) , di (mid(i) ) ) ) && 

55 check(y, interval_hull (di (mid(i) ) , di (sup(i) ) ) ) ; 

56 } 

57 > 

58 public: 

59 Checker (const di& bound, const Q& fun, const char *file) 

60 : b(bound), f(fun), log(file) -Q 

61 bool operator () (const Segment& s) { 

62 return check(s . f unctionO , s .pre image ()) ; 

63 > 

64 }; 

65 

66 int main() 

67 { 

68 const di bound (" [0.9999] ") ; 

69 // Verify claim for d==5 

70 Checker C5(bound, Q(5), "Q5.txt"); 

71 Segment L51(di(" [le-5,0. 15] ") , di("[2]"), di("[0]")); 

72 Segment L52(di(" [0.15]"), di(" [0.30] ") , di(" [0.30] ") , di(" [0.50] ")) ; 

73 Segment L53(di(" [0.30] ") , di(" [0.50] ") , di(" [0.50] ") , di(" [0.70] ")) ; 

74 if(C5(L51) && C5(L52) && C5(L53)) { 

75 std::cout « "Claim is true for d==5." << std::endl; 

76 } 

77 // Verify claim for d==6 

78 Checker C6 (bound, Q(6), "Q6.txt"); 

79 Segment L61(di(" [le-5,0. 10] ") , di("[2.5]"), di("[0]")); 

so Segment L62(di(" [0. 10] ") , di(" [0.25] ") , di(" [0.25] ") , di(" [0.50] ")) ; 

81 Segment L63(di(" [0.25] ") , di(" [0.50] ") , di(" [0.50] ") , di(" [0.75] ")) ; 

82 if(C6(L61) && C6(L62) && C6(L63)) { 

83 std::cout << "Claim is true for d==6." « std::endl; 

84 } 

85 // Verify claim for d==7 

86 Checker C7 (bound, Q(7), "Q7.txt"); 

87 Segment L71(di(" [le-5,0. 10] ") , di("[3]"), di("[0]")); 

88 Segment L72(di(" [0. 10] ") , di(" [0.30] ") , di(" [0. 15] ") , di(" [0.40] ")) ; 

89 Segment L73(di(" [0. 15] ") , di(" [0.40] ") , di(" [0.30] ") , di(" [0.61] ")) ; 

90 Segment L74(di(" [0.30] ") , di(" [0.61] ") , di(" [0.39] ") , di(" [0.70] ")) ; 

91 if(C7(L71) && C7(L72) && C7(L73) && C7(L74)) { 

92 std::cout << "Claim is true for d==7." << std::endl; 

93 } 

94 // Verify claim for d==8 

95 Checker C8 (bound, Q(8), "Q8.txt"); 

96 Segment L81(di(" [le-5,0. 10] ") , di("[3]"), di("[0]")); 

97 Segment L82(di(" [0.10] ") , di(" [0.30] ") , di(" [0.20] ") , di(" [0 . 50] ") ) ; 

98 Segment L83(di(" [0.20,0.34] ") , di(" [1 . 25] ") , di(" [0.25] ")) ; 

99 if(C8(L81) && C8(L82) && C8(L83)) { 

100 std::cout << "Claim is true for d==8." << std::endl; 

101 } 

102 > 
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